Word Length Frequency and Distribution in English: Observations, Theory and Implications for the Construction of Verse Lines

نویسندگان

Hideaki Aoyama

John Constable

چکیده

Recent observations in the theory of verse and empirical metrics have suggested that constructing a verse line involves a pattern-matching search through a source text, and that the number of found elements (complete words totaling a specified number of syllables) is given by dividing the total number of words by the mean number of syllables per word in the source text. This paper makes this latter point explicit mathematically, and in the course of this demonstration shows that the word length frequency totals in English output are distributed geometrically (previous researchers reported an adjusted Poisson distribution), and that the sequential distribution is random at the global level, with significant non-randomness in the fine structure. Data from a corpus of just under two million words, and a syllable-count lexicon of 71,000 word-forms is reported. The pattern-matching theory is shown to be internally coherent, and it is observed that some of the analytic techniques described here form a satisfactory test for regular (isometric) lineation in a text.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A ug 1 99 8 Word Length Frequency and Distribution in English : Observations , Theory , and Implications for the Construction of Verse Lines Hideaki

متن کامل

A ug 1 99 8 Word Length Frequency and Distribution in English : Observations , Theory , and Implications for the Construction of Verse

متن کامل

Isometric Lineation in English Texts: An Empirical and Mathematical Examination of its Character and Consequences

In this paper we build on earlier observations and theory regarding word length frequency and sequential distribution to develop a mathematical characterization of some of the language features distinguishing isometrically lineated text from unlineated text, in other words the features distinguishing isometrical verse from prose. It is shown that the frequency of Qn of n syllables making comple...

متن کامل

Do We Need Discipline-Specific Academic Word Lists? Linguistics Academic Word List (LAWL)

This corpus-based study aimed at exploring the most frequently-used academic words in linguistics and compare the wordlist with the distribution of high frequency words in Coxhead’s Academic Word List (AWL) and West’s General Service List (GSL) to examine their coverage within the linguistics corpus. To this end, a corpus of 700 linguistics research articles (LRAC), consisting of approximately ...

متن کامل

High- and Mid-Frequency Vocabulary Size as Predictors of Iranian University EFL Students’ Speaking Performance

Literature is replete with the studies focusing on the role of vocabulary knowledge in second language receptive skills. However, the relationship between the aspects of vocabulary knowledge and productive skills in general, and the speaking performance in particular has remained scanty in the related literature. This paper examined the relationship between knowledge of L2 vocabulary size at di...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره cmp-lg/9808004 شماره

صفحات -

تاریخ انتشار 1998

Word Length Frequency and Distribution in English: Observations, Theory and Implications for the Construction of Verse Lines

نویسندگان

چکیده

منابع مشابه

A ug 1 99 8 Word Length Frequency and Distribution in English : Observations , Theory , and Implications for the Construction of Verse Lines Hideaki

A ug 1 99 8 Word Length Frequency and Distribution in English : Observations , Theory , and Implications for the Construction of Verse

Isometric Lineation in English Texts: An Empirical and Mathematical Examination of its Character and Consequences

Do We Need Discipline-Specific Academic Word Lists? Linguistics Academic Word List (LAWL)

High- and Mid-Frequency Vocabulary Size as Predictors of Iranian University EFL Students’ Speaking Performance

عنوان ژورنال:

اشتراک گذاری